Rank in Wordlist | Frequency | Word |
---|---|---|
4725 | 11 | 1,000 |
7498 | 7 | 2,5 |
8699 | 6 | 2,000 |
10269 | 5 | 1,5 |
10312 | 5 | 3,5 |
12566 | 4 | 1,100 |
12567 | 4 | 1,4 |
12634 | 4 | 20,000 |
12645 | 4 | 3,000 |
12646 | 4 | 3,500 |
Rank in Wordlist | Frequency | Word |
---|---|---|
172 | 181 | suradnik(ci |
36194 | 1 | 0500 (YEKT)/+0600 (YEKST |
36309 | 1 | 101.07(2 |
36394 | 1 | 118.710(7 |
36592 | 1 | 150.36(2 |
36598 | 1 | 151.964(1 |
36733 | 1 | 174.967(1 |
36908 | 1 | 1931.(Priznanje |
36936 | 1 | 195.078(2 |
37071 | 1 | 2-3(-6 |
Rank in Wordlist | Frequency | Word |
---|---|---|
22269 | 2 | -kaiyaah).- |
36144 | 1 | -reⁿ).- |
36169 | 1 | 0,82)/2 |
36194 | 1 | 0500 (YEKT)/+0600 (YEKST |
36576 | 1 | 149) »Samo |
36924 | 1 | 1944-1945).U |
37146 | 1 | 2006.)[4 |
37154 | 1 | 2007;46(1):25-29 |
37239 | 1 | 25.000)koji |
37313 | 1 | 2r)2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
22283 | 2 | 100%-tni |
37114 | 1 | 20%-30 |
Rank in Wordlist | Frequency | Word |
---|---|---|
42074 | 1 | D&D |
50133 | 1 | M&B |
55950 | 1 | R&A |
55951 | 1 | R&B |
72700 | 1 | km² |
88963 | 1 | silver&gold |
90414 | 1 | stanovnika/km&up2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
40504 | 1 | Borik"je |
41806 | 1 | Committee"-a |
50426 | 1 | Mama","I |
54535 | 1 | Planirajuće"ili |
57067 | 1 | Saint-Genet"… |
57620 | 1 | Shakur("Dear |
65200 | 1 | cijanovodik("prusku |
68843 | 1 | gorjeti" |
71633 | 1 | je:"Trgovče |
71634 | 1 | je:"Zar |
Rank in Wordlist | Frequency | Word |
---|---|---|
36194 | 1 | 0500 (YEKT)/+0600 (YEKST |
36203 | 1 | 1+R |
37615 | 1 | 5+1 |
41058 | 1 | C+I+G |
47413 | 1 | Juan+Esteban |
53677 | 1 | PG1247+26 |
55954 | 1 | R0+—nenegativnih |
82790 | 1 | pola+1 |
87388 | 1 | recesija+inflacija |
90347 | 1 | stagnacija+inflacija |
Rank in Wordlist | Frequency | Word |
---|---|---|
38339 | 1 | Ain*t |
98147 | 1 | σ*2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
4874 | 11 | i/ili |
11256 | 5 | km/h |
12176 | 5 | st./km2 |
12637 | 4 | 2006/07 |
13122 | 4 | McCarty/Bonney |
16061 | 3 | 1/3 |
16068 | 3 | 108/00 |
16159 | 3 | 2/3 |
16160 | 3 | 2004./05 |
16161 | 3 | 2004/2005 |
Rank in Wordlist | Frequency | Word |
---|---|---|
36344 | 1 | 10=9-1 |
38294 | 1 | Aglaia=ljepota |
43181 | 1 | ESPERI=nadati |
43698 | 1 | Euphrosyna=radost |
47561 | 1 | KE=da |
51853 | 1 | NA=6,022·1023 |
51884 | 1 | NOKTO=noć |
56936 | 1 | SEBE=Li |
59219 | 1 | TAGO=dan |
59643 | 1 | Thalia=zadovoljstvo |
Rank in Wordlist | Frequency | Word |
---|---|---|
55894 | 1 | Quelle@@hr/b/r/i/Britney_Spears_3573.html |
55896 | 1 | Quelle@@hr/f/r/a/Francuski_Antarktik_9fb4.html |
55897 | 1 | Quelle@@hr/f/r/a/Frank_Sinatra_d0f7.html |
55899 | 1 | Quelle@@hr/k/a/n/Kanadska_ženska_bendijska_reprezentacija.html |
55901 | 1 | Quelle@@hr/o/l/i/Olimpijske_igre.html |
55903 | 1 | Quelle@@hr/p/r/o/Product_placement.html |
55907 | 1 | Quelle@@hr/s/b/_/Sb_reprezenta.html |
55910 | 1 | Quelle@@hr/s/o/h/SOHO_b266.html |
55913 | 1 | Quelle@@hr/s/t/r/Struktura_Zemlje_7d05.html |
55915 | 1 | Quelle@@hr/t/v/_/TV_Dalmacija_f25b.html |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots